UCD-PN: Selecting General Paraphrases Using Conditional Probability

نویسندگان

  • Paul Nulty
  • Fintan Costello
چکیده

We describe a system which ranks humanprovided paraphrases of noun compounds, where the frequency with which a given paraphrase was provided by human volunteers is the gold standard for ranking. Our system assigns a score to a paraphrase of a given compound according to the number of times it has co-occurred with other paraphrases in the rest of the dataset. We use these co-occurrence statistics to compute conditional probabilities to estimate a sub-typing or Is-A relation between paraphrases. This method clusters together paraphrases which have similar meanings and also favours frequent, general paraphrases rather than infrequent paraphrases with more specific meanings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UCD-Goggle: A Hybrid System for Noun Compound Paraphrasing

This paper addresses the problem of ranking a list of paraphrases associated with a noun-noun compound as closely as possible to human raters (Butnariu et al., 2010). UCD-Goggle tackles this task using semantic knowledge learnt from the Google n-grams together with human-preferences for paraphrases mined from training data. Empirical evaluation shows that UCDGoggle achieves 0.432 Spearman corre...

متن کامل

Gender and the Factors Affecting Child Labor in Iran: an Application of IV-TOBIT Model

In this paper we first intend to examine the probability of falling into the realm of child labor by using conditional probability theorem. Furthermore, we will compare the extent of each factor’s effect on boys and girls using a TOBIT regression model. Finally we will analyze aspects of Iran’s labor market to assess the future ahead of the children who work at present. As the results will show...

متن کامل

General and Specific Paraphrases of Semantic Relations between Nouns Article (published Version) (refereed) Natural Language Engineering General and Specic Paraphrases of Semantic Relations between Nouns General and Specific Paraphrases of Semantic Relations between Nouns

Many English noun pairs suggest an almost limitless array of semantic interpretation. A fruit bowl might be described as a bowl for fruit, a bowl that contains fruit, a bowl for holding fruit, or even (perhaps in a modern sculpture class), a bowl made out of fruit. These interpretations vary in syntax, semantic denotation, plausibility, and level of semantic detail. For example, a headache pill...

متن کامل

Topic Models for Word Sense Disambiguation and Token-Based Idiom Detection

This paper presents a probabilistic model for sense disambiguation which chooses the best sense based on the conditional probability of sense paraphrases given a context. We use a topic model to decompose this conditional probability into two conditional probabilities with latent variables. We propose three different instantiations of the model for solving sense disambiguation problems with dif...

متن کامل

Manual and Automatic Paraphrases for MT Evaluation

Paraphrasing of reference translations has been shown to improve the correlation with human judgements in automatic evaluation of machine translation (MT) outputs. In this work, we present a new dataset for evaluating English-Czech translation based on automatic paraphrases. We compare this dataset with an existing set of manually created paraphrases and find that even automatic paraphrases can...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010